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ABSTRACT 



In the absence of spectra, fitting template model spectra to observed photo- 
metric fiuxes, known as Spectral Energy Distribution (SED) fitting, has become 
the workhorse for identifying high-z galaxies. In this paper, we present an analysis 
of the most recent and possibly most distant galaxies discovered in the Hubble 
^_^ Ultra Deep Field using a more robust method of redshift estimation based on 

Q Markov Chain Monte Carlo fitting (MCMC), rather than relying on the redshift 

O of "best fit" models obtained using common x^ minimization techniques. The 

^ advantage of MCMC fitting is the ability to accurately estimate the probability 

I density function of the redshift, as well as any other input model parameters, 

tL allowing us to derive accurate credible intervals by properly marginalizing over 

c/3 all other input model parameters. We apply our method to 13 recently identified 

I— I sources and show that, despite claims based on x^ minimization, none of these 

,-H sources can be securely ruled out as low redshift interlopers given the low signal- 

^ to-noise of currently available observations. We estimate that there is an average 

0> probability of 21% that these sources are low redshift interlopers. 

in 

• Subject headings: Methods: statistical. Galaxies: high- redshift, Calaxies: pho- 

(^ tometry. Galaxies: stellar content 

cn 



1. Introduction 

Over the last 15 years the ever increasing pace at which new, more sensitive detectors and 
larger telescope apertures have come online has spurred a fast and furious race to detect the 
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most distant galaxies in the Universe. Emission features from HIT regions (such as Lyman- 
a), have been a useful tool for detecting star- forming galaxies at 3 < z < 6 (corresponding 



to rest-frame optical wavelengths) (Malhotra et al. 2001). However, at higher redshifts, cor- 
responding to the first several hundred million years since the Big Bang, more and more 
matter was locked up in neutral hydrogen. At these early epochs the neutral intergalactic 
medium (IGM) attenuates significant amounts of light from galaxies, including Lyman-a, 
making detection of these systems more difficult. If Lyman-a is significantly diminished then 
one must rely on the detection of a continuum break to select candidate galaxies. Detecting 
galaxies at these redshifts places constraints on the epoch of re-ionization, thought to be at 
z>6 ( Fan et aL]|2006 Komatsu et al.pOll ). Detecting and measuring the properties of these 
galaxies is critical to understand what caused the epoch of re-ionization. 

Detecting and measuring the properties of high-z galaxies is not easy. At rest-frame 
A <1216A the fiux from high redshift galaxies can be completely vanquished due to the 
Lyman-a forest absorption and when observed in a filter corresponding to this wavelength 
the galaxy "drops out." At A > 1216A the fiux is not attenuated and can be detected. Such 
breaks become more pronounced with redshift, for example at z > 5.7 a break corresponds 
to ~ 3.4 mags in color (e.g. Songaila & Cowie 2002 Hu et al. 2004). Detection at z > 
8 is further complicated by the fact that the Lyman regime is redshifted into near-infrared 
wavelengths (A > 1 yum) where bright atmospheric telluric skylines and strong telluric ab- 
sorption features force ground-based observations to use specific and restrictive atmospheric 



windows. Large HST based spectroscopic surveys such as GRAPES and PEARS (Pirzkal 



et al. 2004) have demonstrated the power of low resolution spectroscopy to identify high 



redshift sources (Malhotra et al. 2005; Rhoads et al. 2009). 



Recently, the race to find the earliest galaxies has relied more and more on the use of the 
near-infrared Wide Field Camera 3 (WFC3) detector (1 < A < L6 /im) on the Hubble Space 
Telescope (HST), and in particular, broad-band imaging. Detecting possible 1216A breaks 
with HST photometry should be straightforward, assuming one collects a sufficient amount 
of photons. However, observations must first contend with the possibility of foreground in- 
terlopers. At z ~ 6 foreground interlopers can be as frequent as 45% for observations with 
signal-to-noise (S/N) < 5 (e.g. [Dickinson et al.||2004t [Stanway et al.||2008| )). Without the 
direct detection of spectroscopic lines, redshift confirmations must rely on the application of 
SED fitting to measured photometric fiuxes. 

One of the first tools used to derive photometric redshifts was the SED fitting code 
hyperz which simply maximizes the likelihood by brute-force on a grid with standard SED 



templates (Bolzonella et al. 2000). As noted in Bolzonella et al. (2000): "Both the 2:phot and 
the SED are obtained through hyperz, together with the best fit parameters (Ay, spectral 
type, metallicity and age). Because of the degeneracy between these parameters, the relevant 
information shall be the redshift and the rough SED type, in the sense that a given object 



has a "blue" or "red" continuum at a given z, but no reliable information can be obtained 
about the other parameters from broad-band photometry alone." Despite of this, hyperz 



and other similar x^ minimization SED fitting codes (e.g. Thompson et al. 2001 Papovich 
eraI][200T| |Labbe et alT][2003l [Schaerer fc Pello |[2005| [Mobasher eFaT] [20051 [Pirzkal et a~ 
2007 Mclure et al.||2009 ) have been used to extract more than just redshifts, including these 
degenerate properties. All of the latter is based on simply selecting the template with the 
smallest x^. 

This has led to a number of situations in which photometric redshifts and galaxy prop- 
erties determined from y^ minimization SED fitting have produced questionable claims of 
high-z galaxies. Pello et al. (2004) claimed the discovery of a z = 10 galaxy from a combina- 



tion of deep J, H, K photometry obtained from the VLT, HST optical imaging, and J-band 



spectroscopy from ISAAC/VLT. While Pello et al. (2004) claimed a detection of Lyman-a 



from low S/N ISAAC/VLT spectra, the basis for the claim relies primarily on SED fitting 
from the application of hyperz to stacked J, H, K imaging. Followup deep F-band ( Vab 



27.8 mag) imaging from the VLT (Lehnert et al. 2005) and deeper i/-band imaging from 



Gemini/NIRI (Bremer et al. 2004) failed to detect fluxes predicted from the SED fitting. 
Lehnert et al. ( [2005 ) and a re-analysis of the spectra by Weatherley, et al. (2004) led to 
the conclusion that the object was a spurious detection, even though the fits from hyperz 



produced an excellent x fit. Mobasher et al. (2005) claimed a strong Balmer photometric 



break between the i/-band and Spitzer IRACl (3.6 /im) resulting in a z = 6.5 claim for 
an early-type quiescent galaxy and raising serious questions regarding the viability of the 
A-CDM paradigm of building bigger galaxies from smaller building blocks. The redshift 



claim was based solely on a better x^ fit to Bruzual & Chariot (2003) models (CB03) in 
lieu of a dusty starburst model at z = 2.5 (the best fit from a secondary x^ minimum). 
When combined with followup Spitzer IRS data at 16 and 22 /xm, revised x^ minimization 
SED fitting produced a best fit for a z = 1.7 luminous infrared galaxy (Chary et al. 2007). 
Similarly, Henry et al. (2008) claimed the discovery of a Lyman break galaxy at z ~ 9 
from NICMOS/i75T imaging (based on a FiiOVF dropout) and Spitzer imaging. Their x^ 
minimization fitting rejected local minima models at z ~ 2-3 (produced by the degeneracy 
among the parameters). Subsequently, it was observations from MMT/Megacam blueward 
of the claimed break (z =26 mag) that demonstrated the intermediate redshift solutions 



that were initially rejected were actually correct (Henry et al. 2009). Finally, Capak et al. 



(2011) presented three z-band dropout candidates in which x^ minimization fitting produced 



best solutions at z > 7, only to find that followup imaging of one blueward of the claimed 
break and at 24 /xm better supported at z ~ 1.7 solution; and a second with observed i/-band 
flux (from NIRSPEC/Keck spectroscopy) weaker than predicted by SED models. 

One should of course always keep in mind that any SED fitting effort is further compli- 
cated by the assumption that model galaxy spectra constructed from our knowledge of how 



stars evolve in the local Universe is equally applicable to the earliest epochs. One should 
also be cognizant of the fact that the rest-frame wavelength range sampled by a fixed set of 
photometric filters will span a narrower range with increasing redshift. For the most recent 



claims of galaxies at 8 < z < 12 (e.g. Yan et al. 2010 Bouwens et al.|[2011 Ellis et al. 2013), 
WFC3/IR covers only AA ~ 560 and 390 A, (for z = 8 and z = 12, respectively, and ex- 
cluding the 1216A break itself). Including the less sensitive and lower resolution IRACl and 
IRAC2 channels only widens this to AA ~ 3700 and 2600 A for z = 8 and 12, respectively. 
Strong statistical priors exist for only a handful of lensed galaxies (e.g. Coe et al. 2013). 



The relative lack of information about high redshift galaxy candidates, as well as the low 
signal-to-noise of the observations produce a situation where using sound statistical methods 
is particularly important when trying to determine the redshifts of these objects. 

The latest claims of z > 8 galaxies based on x^ minimization SED fitting have been 



presented by Bouwens et al. (2011), Yan et al. (2010), and Ellis et al. (2013) using deep 



WFC3/IR images of the Hubble Deep Field. The results from Ellis et al. (2013) are particu- 
larly interesting since this project adds additional data (GO 12498, PI: Ellis, HUDF12) to the 
HUDF09 field (GO 11563, PI: Illingworth) in the F105W and F160W filters as well as new 
observations with the F140W filter. This filter spans a wavelength range inaccessible from 
the ground. Surprisingly since the fields are nearly identical, all but one of the sources listed 
in Bouwens et al. (2011) and none of the sources listed in Yan et al. (2010) were confirmed 



using these new deeper data. The new near-infrared measurements from the HUDF12 are 
the deepest observations available to date and for the foreseeable future. When combined 
with ancillary data from the HDUF09, these data allow one to construct spectral energy 
distributions of these sources spanning from observed i?-band (ACS F435W) to IRACl and 



IRAC2 channels (3.4 and 4.6 /xm, respectively). The 13 objects listed in Ellis et al. (2013) 
show large red colors consistent with strong spectral breaks. While foUowup spectroscopic 



observations have only been attempted on one of the targets, UDFy-38135539 (Lehnert et al. 



2010 Bunker et al.||2013 ) and will be discussed later, the objects have all been fit using what 
is now considered the "standard method" of minimized x^ SED fitting, yielding redshifts 
of 8 < z < 12. Given the limitations of this method, and the numerous examples of what 
later turned out to be incorrect redshift identifications, we are motivated to apply more 
robust Bayesian techniques to better constrain the physical parameters of galaxies observed 
using broad band filters. Central to this effort is the need to derive realistic estimates of the 
uncertainties for each derived parameter. 



Thus, using the sources and fluxes, including ACS, WFC3 and IRAC, provided in Ellis 



et al. (2013), and reference therein, for 13 candidate high-z 1216A break galaxies we employ 



ttMC , a Markov Chain Monte Carlo (MCMC) approach (for details see Pirzkal et al. 



2012a|[b ). For the purpose of this paper we define low redshift to be 2; < 4 and high redshift 



to be z > 4. Further, we treat all 13 objects listed in Ellis et al. (2013) as potential high 



redshift candidates to be tested with vrMC^. We are not implying that ttMC^ is the only 
viable method to perform this type of analysis. MCMC is a well established approach 
to this type of problem. The marginal signal-to-noise of current high redshift candidates 
observations warrant more robust techniques than the common y^ minimization technique 
and the reliance on a single "best fit" model provided by hyper-derived SED fitting software. 



2. Applied Techniques: Minimized ^ SED fitting vs. ttMC^ 

Nominally, the "standard method" of minimized x^ SED fitting works by matching pho- 
tometric fluxes to a pre-selected grid of model templates until the residuals between model 
and data are minimized. However, the choice of input models, the number of parameters 
(and whether any are constrained), parameter degeneracy and if the models sufficiently sam- 
ple the entire range of the physical parameter space all affect the outcome of the fit. Any 
and all of these issues can produce "best-fit" solutions that are unphysical (i.e. a very old 



object in a young Universe as in Labbe et al. (2010); Richard et al. (2011)). Once the num- 



ber of model parameters to be fit exceeds three it becomes computationally inefficient to 
fit parameter grid SEDs, more so for errors because each parameter must be refit following 
a pure Monte Carlo approach (e.g. each parameter requires several thousand additional 
iterations). All of the most recent B.ST based high-z candidate objects have relied on some 
modified implementation of the hyperz photometric code, or similar x^ SED minimization 



techniques (e.g. [McLure et aL]|2010| |20lH [Bouwens et aL]|2011t [Ellis et al.|[2013| ). While 
hyperz has proven to be capable for accurately constraining low-z photometric redshifts for 
large numbers of sources in deep, often crowded fields later confirmed with spectroscopic fol- 
lowup, it was designed to estimate the most /z/;;e/y photometric redshift, not produce credible 
intervals. Its treatment of the probability density functions (PDFs) is inadequate. Based on 
the hyperz Users Manual and Miralles (2013) it minimizes over the extra dimensions (T) 



instead of marginalizing over them, therefore the PDF returned by hyperz is given by: 



Ph- 



yperz 



(z) = minp(z, T) 



;i) 



This approach does not account for the volume of the parameter space probed resulting 
in unreliable confidence intervals, particularly the case of additional maxima. While such a 
distribution may be somewhat useful, the credible intervals produced by hyperz should not 



be trusted (Miralles 2013). 



In contrast, MCMC, and our implementation of it, ttMC^, is a random sampling method 
which samples the entire parameter space but does not sample the multidimensional region 



uniformly. The posterior probability distribution function can be constructed by simply 
creating a histogram of the values of a given parameter in the MCMC chain from values 



taken after the chain converges (see Section 2 in Pirzkal et al. (2012a) for more details). 
Just as in Pirzkal et al. (2012a) we quote both the 95% and 68% credible regions here (see 



Section 2.1 of Pirzkal et al. (2012a) for a discussion of 95% vs 68% credible regions, which 



we take to be the 95% and 68% Highest Posterior Density, or HPD, regions, respectively). 
While computationally more expensive (yet more efficient), our current implementation of 
the MCMC SED fitting code, ttMC^, has three main features: 

1) It does not depend on a pre-defined input model parameter grid and allows for a compu- 
tationally efficient exploration of the input parameter space. 

2) the effect of nebular emission lines and nebular emission continuum can be included. 

3) It allows one to derive a statistically valid posterior PDF for each of the input parameters 
by integrating over the remaining parameters: 



p{z) 



p{z,T)dT 



(2) 



To properly assess the possibility of lower-redshift solutions, it is imperative that the vol- 
ume of parameter space be taken into account by integrating over the additional degrees of 
freedom. 

We have applied vrMC^ to the sample of high redshift candidates listed in Ellis et al. 



(2013) because this sample is based on the deepest i7ST observations to date and provides 



candidate objects that could be the faintest examples of Lyman break galaxies. The anal- 



ysis of redshifts and other parameters using the photometric fiuxes from Ellis et al. (2013) 



are based upon using the use of stellar population models from Bruzual & Chariot (2003) 



(BC03), pmrlot and Bru'iuall ( |20l3l ) (CB07), and |Marasto"E| ( [2005| (MA05) in conjunction 
with ttMC^. The parameters investigated include: stellar mass; a broad range of extinction 
values (0 < Av < 10); metallicity (0.005 < Z < 0.05); and stellar population ages (limited 
only by the age of the Universe at a given redshift). We adopted fiat priors for these param- 
eters. The application of these models has also been tested with and without the effects of 
nebular continuum and emission lines (as parametrized by the escape fraction parameter). 
Details regarding the application of continuum nebular emission and emission lines to ttMC^ 



can be found in Section 2.2 of Pirzkal et al. (2012a). In addition to using single stellar 



population models (SSP), the BC03 models were also tested using an exponentially decaying 
star formation history model (parametrized by r). Thus, there are seven discrete population 
models used with vrMC^: 

1) A SSP BC03 model with nebular contribution; 2) A SSP BC03 model without nebular 
contribution; 3) A BC03 model with e"*/"^ star-formation history and without nebular con- 
tribution; 4) A SSP CB07 model with nebular contribution; 5) A SSP CB07 model without 



nebular contribution; 6) A SSP M05 model with nebular contribution; and 7) A SSP M05 
model without nebular contribution. 



3. Results 
3.1. Redshift Constraints 

A subset of the results of the ttMC^ analysis, obtained using the BC03 stellar population 
model with nebular emission, is presented in the first five columns of Table [ij For the most 
part, the redshift, extinction, stellar ages, stellar masses and metallically ranges are uniform 
among the seven input models we used with ttMC^. Any particular outliers are noted in 
Table 1. 

The results using the BC03 are representative of what is derived for nearly all of the 
models. Table [l] lists the 68% and the 95% credible intervals derived using vrMC^. Table 1 
clearly shows that the 95% credible regions do not strongly constrain the redshift ranges of 
most of the sources. It is also clear from Table 1 that the 95% credible intervals are much 
larger than the 68% credible intervals which means that redshift posterior PDFs are not 
Gaussian. 

A comparison of the quality of the fits (parameterized as the log likelihood computed 
by MCMC) from the most representative high redshift {z > 4) solution and that of the most 
representative low redshift {z < 4) solution, shows that high redshift models appear to fit the 
observations better than lower redshift models. As a first step, we can examine whether high 
redshift models always fit the data significantly better than alternative low redshift models 
by computing the significance p oi a likelihood ratio test. This is a comparison between the 
log likelihood of the best fitting (usually z > 4) and best fitting low-z (z < 4) models. If p 
> 0.05 then the hypothesis that the high-z model fits the observation better than the low-z 
model is rejected at the 2a level. 

While we used stellar population models both with and without the effect of nebular 
lines is significant, we find that when using BC03 models without nebular emission, 7 of the 
13 sources are reasonably fit by low redshift models. However, if nebular emission is included 
then 12 out of 13 objects are reasonably fit by low redshift models. The full range of values 
obtained from using different stellar population models are shown in Table [l| Clearly, the 
quality of the fits is not enough to confidently rule out these sources as low redshift inter- 
lopers. 

The MCMC methodology also allows us to compute the actual probability that a source 
is at 2 < 4 by integrating the posterior PDF of each object. The range of values across all 
seven stellar population models described above is shown in the last column of Table [Tj Low 



redshift solutions cannot be strongly excluded based on the observed photometric break for 
many of the objects listed in Table [T] When examining our results using all seven stellar 
population models, We find that 9 out of 13 objects cannot be ruled out as low redshift 
interlopers in the best case scenario (i.e. selecting the stellar population models that most 
favor high redshift solutions), and none out of 13 in the worst case scenario (i.e. selecting 
the stellar population models that most favor low redshift solutions). 

Low redshift {z < 4) solutions can, at best, be excluded at the 2a confidence level 
{P{z < 4) < 0.05) for only four sources (UDF12-3947-8076, UDF12-3954-6284, UDFy- 
3779600 and UDFy-38135539). For each of the seven stellar population, the average proba- 
bility for P{z < 4), averaged across all 13 sources, is remarkably consistent and ranges from 
19% to 26% with a global average of 21%. We conclude that ^ 21% of the sources in Table 
[T]are therefore likely to be low redshift interlopers. 

As in the case of earlier attempts to detect high-z systems (e.g. Henry et al. 2009[ ), it 
turns out that the limiting sensitivities of fluxes blueward of the presumptive break (i.e. ACS 
bands in our case) are not sufficient to provide strong constraints for the redshifts proposed 
by simple minimized x^ ^^^ "best fit" model SED fitting. Similarly, the IRAC observations 
are too shallow to provide enough constraints on the rest frame optical light from most of 
these objects. 

In the case of four sources for which low-z solutions can be statistically rejected, three 
(UDF12-3947-8076, UDFy-3779600 and UDFy-38135539) are y-band dropouts and the bright- 
est objects in the sample. Thus, relative to the ACS and IRAC detection limits, they have 
enough fiux in the WFC3 bands to produce a change in fiux strong enough to resemble a 
1216A decrement. The third object, UDF12-3954-6284, is very faint and only detected in 
one band. By definition it is a marginally acceptable candidate because it is only detected in 
one band (as noted by Ellis et al. 2013). It is possible that the observed photometric break 
could be caused by a strong emission line at low-z (see the Discussion section). 

The ACS and IRAC detection limits are too high to unambiguously identify a 1216A 
decrement for the remaining 9 objects in the sample. We estimate that the ACS and IRAC 
detection limits are ~ 5 and ~ 3 times too high (respectively) to distinguish between a 
1216A decrement and a Balmer break in sources this faint. This remains the main limiting 
factor in securely identifying sources ai z > 8.5 using WFC3 observations of the HUDF. 

In Figure [l] we show the SEDs of objects UDF12-3954-6284 and UDFy-37796000. The 
first example is meant to illustrate how too high of rest frame optical limits do not allow to 
reject low redshift solutions. The second example shows a clear detection of a strong pho- 
tometric break as well as constraining limits in both the rest frame UV and optical bands. 
In the case of UDF12-3954-6284, the IRAC detection limits are clearly too high to rule out 
that this object is a low redshift interloper, if we allow for nebular emission. The log likeli- 
hood of the high-z and low-z solutions for this object are 720.9 and 722.4 with a likelihood 



ratio test confidence of 0.08 wliicli indicates tliat tlie liigli redsliift model does not fit the 
observation significantly (i.e. 2a) better than the low redshift model. While models lacking 
nebular emission clearly favor a high redshift solution for this object, the break can clearly 
be reproduced by an emission line. 

A low redshift solution is much less likely in the case of the significantly brighter object 
UDFy-37796000 with a redshift of ~ 1.6 and Av= 2.8. The log likelihoods of the high and 
low redshift solutions are 722.6 and 717.5, respectively, resulting in a very low likelihood 
ratio test confidence values. This case demonstrates how strong rest-frame near UV and 
optical limits can help rule out a Balmer break. 



3.2. Secondary Parameters 

In addition to redshift constraints vrMC^ was used to determine other parameters in- 
cluding: stellar population ages; extinction; metallicity; and stellar mass. In the cases where 
models included nebular continuum, the escape fractions were determined; and in the case 
of BC03 with an exponentially decaying starburst, value of r were derived. The 95% and 
68% credible regions estimates for these parameters are given in Table [I] for each object. In 
the cases of three parameters: metallicity, escape fraction, and r the posterior PDFs were 



essentially flat. This confirms results from Pirzkal et al. (2012a) where it was demonstrated 



that these parameters can only be constrained using very high precision photometry (i.e. 
better than 1% level), which is not the case for the UDF12. 

The interdependence of the input model parameters is illustrated in Figure [2] which 
shows the two dimensional distribution of the posterior PDF's for UDF12-3895-7114. The 
statistically more likely regions are shown using proportionally lighter shades. This Figure 
illustrates the statistical complexity of fitting high redshift sources to stellar population mod- 
els, non Gaussian nature of many of the input model parameter PDF's, as well as the low 
redshift solutions which fit the observations. As suggested above, deeper ACS observations 
(an increase in sensitivity by a factor of 5 for the F850LP ACS filter) would make it possible 
to more confidently exclude many of the low redshift solutions. 



4. Discussion 

The low signal-to-noise and large errors associated with the deepest WFC3 observations 
to date requires a more robust analysis than common x^ minimization techniques can pro- 
vide to derive accurate credible regions for redshifts, especially in the faint target, highest 
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redshift regime. MCMC is able to provide just such an analysis. A complete SED analy- 
sis of the high redshift candidates discussed in this paper shows that there is a significant 
possibility (1 in 5) that these sources are low to moderate redshift (z<4) interlopers. This 



is in contrast to the results presented by Ellis et al. (2013), where they conclude that 7 out 



of 13 sources can be confidently classified as high-z galaxies. Examining our 68% redshift 



credible regions might get the impression that our results agree well with those of Ellis et 



al. (2013). However, the limited number of measurements as well as the low signal to noise 
of the available measurements cause the redshift PDF of these sources to be highly non 
gaussian. This in turns lead us to revise upward the probability that these sources are low 
redshift interlopers. In the end, we find that only the redshifts of the brightest y-dropouts 
in this sample are likely to be at a high redshift {z ^ 8). Although the 128 orbits of the 
UDF12 have added considerable photometric depth to the UDF, the sources listed in Table 
[l]are still extremely faint. Moreover, given the large uncertainties in both flux and position, 
current IRAC observations do not strongly constrain the redshift of the vast majority of 
these objects. Similarly, the ACS observations are of insufficient depth to strongly constrain 
the SED blue-ward of the (presumptive) Lyman (or possibly Balmer) break. Spectroscopic 
observations, while difficult and time consuming, still remain a viable method for further 
constraining the proposed photometric redshifts and have been attempted for two of the 
sources in Table [T] The first, UDF12-3954-6284, was observed using the WFC3 G141 grism 
(A = 1.075-1.7 fim with 6X = 0.00465 /im) as part of the 3D-HST survey (GO-12177; PI: van 
Dokkum) for two visits (4.7 ks each, the same orientation for each visit) and as a part of the 
CANDELS supernova follow-up program (GO-12099; PI: Riess) also for two visits (6.6 ks 



and 15 ks) each with a different orientation. Brammer et al. (2013) report a 2.7cr detection 



of an emission line at 1.599/xm based on smoothing and cross correlating the 2D spectrum 
with a kernel constructed from the central 0".3 of the F160Wtlmmbna.il image of UDF12- 
3954-6284 and removing any contamination using a contamination model (see their Figure 
2). The object is not detected in any single visit. If the emission line were from Lyman-a 



it would place the galaxy at z ~ 12, but Brammer et al. (2013) reject this possibility based 



not on the rest-frame equivalent width (~ 170 A) but because Lyman-a emission should be 
attenuated early in the epoch of re-ionization. Instead, they conclude the line is most likely 
[OIII], placing the object at z ~ 2.2. Note that, UDF12-3954-6284, which was detected in 
only one WFC3 broad band filter, is one of the three objects with the strongest probabilities 
of being at high redshift. 

The second object with follow-up spectroscopic observations is UDFy-38135539, all of 
which were obtained with ground-based instruments. It was first observed with SINFONI 
on the VLT (integration time of 14.8 hours), centered on the J-band (1.1-1.4 /xm) resulting 



in a reported 6cr detection of Lyman-a, placing the galaxy at z = 8.555 (Lehnert et al. 



2010). Subsequent observations using X-SHOOTER (0.3-2.5 fim) on the VLT and MOIRCS 



11 



(0.9-1.78 /iin) on Subaru (5 hours and 11 hours integration times, respectively), as well as 
re-analysis of the original SINFONI have disputed this claim based on non-detections of 



Lyman-a (Bunker et al. 2013). They further noted that the F105WWFC3 photometry is 



inconsistent with both the line flux claimed by Lehnert et al. (2010) and the redshift from 



Mclure et al. (2009). If the redshift were correct, then the FlOSWflux should be detected at 



15a. Evidently, the discrepancy in photometric measurements casts a doubt on our previous 
conclusion that this object was more unlikely to be a low redshift interloper. 

Using ttMC^ we find that there are two populations of low-z galaxies which fit the avail- 
able data as well as the high-z solutions. The first are low-moderate dusty systems at z ~ 2, 
the second are significantly dustier objects (up to Ay = 4) at z < 1. Distinguishing between 
these low-z solutions requires significantly more sensitive/deeper ACS and IRAC observa- 
tions (see Figure [I]) than are currently available (note: these would also serve to reject or 
improve the likelihood of high-z redshifts as well). Deeper IRAC observations, should any be 
undertaken in the remaining time of the Spitzer Warm Mission, must contend with reach- 
ing source confusion limits as well as the higher background noise from operating in "warm 
mode." Contrary to common wisdom and the conclusions reached by other high-z studies, 
deeper observations blue-ward (e.g F850LP or F8I4 W) of the presumptive break would be 
extremely useful, (see Figure [I]). Alternatively, one can seek sources that are brightened by 
lensing due to foreground galaxy clusters to get more flux from high-z sources (e.g 



Coe et 



al. 2013). However, only a limited volume can be probed using this method. 



Unfortunately, deeper observations in the optical bands using ACS are not likely con- 
sidering the prohibitive observing times that would be required. Shtless spectroscopic ob- 
servations using WFC3 with the G141 grism would provide an excellent method of rejecting 
or supporting the claims of high redshifts. However, this requires multiple position angles 
to avoid contamination from other objects or observational artifacts in the field as well as 



observations significantly deeper than those attempted to date (Pirzkal et al. 2012b). The 



G141 WFC3 grism observations would have the sensitivity to detect the emission lines po- 
tentially responsible for the observed photometric breaks. For example, in the case of object 
UDF12-3954-6284, an emission line with a flux as low as ~ 1.5 x 10"^*^ erg/s/A/cm^ could 
be solely responsible for a 29.3 magnitude detection in the F160W filter. Such a faint line 
would require upward of forty-two orbits to be significantly detected. In the absence of 
deeper HST or IRAC observations, observers will have to await the launch of JWST. We 
estimate that lOKs observations at O.dfim using NIRCAM with the JWST would be deep 
enough to confirm that the observed photometric breaks are strong enough to be 1216A 
breaks. Until such observations are made, the nature as well as the volume density of very 
high redshift candidates are likely to remain uncertain unless significantly deeper ACS or 
WFC3 observations are taken. At the very least, our analysis suggests that the uncertainties 
of current estimates of the volume density of these objects should be adjusted to account for 
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the possibility that nearly one in four of these objects is not a high redshift galaxy. 
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Fig. 1. — High and low redshift solutions to the observed photometry of objects UDF12-3954- 
6284 and UDFy-37796000. The upper left sub-panels show all of the available photometry, 
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(ACS and WFC3 bands) region for the same object. Two models are shown in each case. 
The first, shown in a dashed red line, is a z > 8 model while the solid blue line shows a 
much lower redshift solution at 2; ~ 2. Observed fluxes are shown using error bars. Synthetic 
model fluxes in each observed filter are shown using large circles. 
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Fig. 2. — Posterior probability density function estimates for object UDF12-3895-7114. The 
non-gaussian, multi-modal nature of the redshift distribution is evident in the rightmost 
plots. 
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